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ABSTRACT 

The  information  transfer  across  the  battlespace  is  expanding  at  an  ever  increasing  pace.  Simulations  have 
to  be  carried  out  to  ensure  the  quality  of  service  of  the  different  applications  (sensors,  command  and  control 
systems,  voice  communications,  . . . )  all  integrated  in  a  dynamic  network  environment.  Two  approaches 
are  common:  discrete  event  simulation  and  fluid  approximation.  A  discrete  event  simulation  generates  a 
huge  amount  of  events  for  a  full-blown  battlefield  communication  network  resulting  in  a  very  long  run¬ 
time.  The  usability  of  this  simulation  technique  is  very  limited  for  a  rapid  changing  combat  theater.  A 
faster  fluid  based  approximation  lacks  the  accuracy  wanted  for  a  realistic  simulation.  A  hybrid  simulator 
separates  the  traffic  in  two  classes.  The  packets  of  the  foreground  traffic,  for  which  fine-grained  performance 
details  are  needed,  are  simulated  by  an  event  driven  approach,  while  the  background  traffic,  for  which 
less  detailed  information  is  required,  is  approximated  by  a  fluid  model.  A  research  area  of  this  hybrid 
approach  is  the  characterization  of  the  fluid  dataflow.  Classical  statistics  are  inappropriate  for  real  network 
data  due  to  the  heavy-tailed  behaviour  and  the  self-similarity  of  network  traffic.  This  paper  discusses  the 
estimates  based  on  the  Large  Deviations  asymptotic,  the  Central  Limit  Theorem  and  a  range  of  scaling  laws 
in  between  both  limits,  a  so  called  Moderate  Deviations  approximation.  Based  on  real  network  traces  the 
loss  probability  of  a  data  stream  can  be  estimated  for  all  models.  The  results  of  this  protocol  independent 
analysis  can  be  incorporated  in  a  discrete-event  simulation  allowing  a  considerable  reduction  of  computation 
time  and  memory  consumption.  The  desired  fine-grained  details  of  the  foreground  application  behaviour 
can  be  obtained  while  taking  into  account  the  background  traffic.  In  this  article  different  characterization 
methods  of  background  traffic  in  a  hybrid  network  simulation  are  presented  and  the  results  are  compared 
with  discrete-event  simulations. 


1.0  INTRODUCTION 

The  basic  events  for  most  packet-event  network  simulations  are  the  arrivals  of  packets  in  a  network  node  and 
the  departure  of  packets  on  a  carrier.  The  model  of  the  network  consists  of  queues  having  a  limited  buffer  size 
and  links  characterized  by  their  transmission  capacity.  A  high  accuracy  is  gained  but  the  simulation  becomes 
very  slow  due  to  the  computational  cost  when  many  events  are  generated.  A  large  amount  of  events  can  be 
generated  for  high  bandwidth  links  or  for  networks  with  a  great  number  of  devices.  A  parallel  discrete-event 
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simulation  can  increase  the  scalability  of  the  size  of  a  network  but  can  not  accelerate  the  simulation  of  a 
high  capacity  link  due  the  sequential  nature  of  a  traffic  flow  on  a  link.  The  technique  of  events  generated  by 
packets  has  to  be  abandoned  if  a  higher  simulation  speed  is  required. 

Fluid  based  approaches  for  estimating  the  behaviour  of  network  traffic  can  be  used  to  simplify  the  simulation. 
However  the  level  of  detail  available  to  the  approximation  is  less  and  accuracy  is  sacrificed  for  a  faster  run¬ 
time.  Subtle  protocol  dynamics  can  not  be  studied  using  this  technique.  Network  emulation  systems  that 
interact  with  real  applications  running  on  real  networks,  are  also  out  of  reach. 

The  combination  of  discrete-event  simulations  and  fluid  approximations  can  be  an  answer  to  the  challenge  of 
getting  packet  level  details  in  a  reasonable  amount  of  time.  In  the  majority  of  the  scenarios,  the  simulation  is 
used  to  get  the  performance  of  a  specific  application  when  the  data  from  that  application  is  multiplexed  over 
the  network.  A  hybrid  simulator  separates  the  traffic  in  two  classes.  The  traffic  flows  which  need  the  full 
packet  information,  the  foreground  traffic,  are  simulated  by  an  event-driven  approach,  while  the  background 
traffic,  for  which  less  detailed  information  is  required,  are  approximated  by  a  fluid-flow  model. 

The  amount  of  abstraction  introduced  by  the  fluid  model  determines  the  precision  of  the  calculation.  There 
are  many  papers,  i.e.  [3],  considering  the  transport  layer  behaviour  of  traffic  streams  as  a  basic  approximation 
of  the  data  flow  in  a  real  network.  The  flow  conservation  laws  and  the  congestion  control  techniques  used 
in  TCP  allow  to  write  down  a  system  of  equations  describing  the  traffic  behaviour.  The  traffic  source  has  a 
limited  number  of  parameters  that  have  to  be  estimated  to  get  a  realistic  behaviour  of  a  specific  application 
on  a  network  and  this  has  to  be  done  for  each  source  putting  data  into  the  network.  In  the  case  the  individual 
traffic  streams  are  not  known  or  too  complex  to  demultiplex,  as  for  a  high  bandwidth  traffic  trace,  or  the 
traffic  flow  can  not  be  modeled  by  the  used  equations,  as  for  circuit  emulated  services  over  Ethernet1,  the 
transport  layer  model  is  not  adequate.  In  this  paper  the  link  layer  behaviour  of  traffic  is  modeled  considering 
the  same  network  elements  as  the  discrete-event  network  simulation,  i.e.  buffers  and  links.  The  influence  of 
the  transport  layer  is  embedded  in  the  analyzed  traffic  trace.  The  exact  behaviour  of  the  background  traffic  is 
secondary  to  how  it  influences  the  foreground  packets.  A  parameter  fitting  of  a  traffic  source  to  a  traffic  trace 
is  not  necessary.  Only  the  interaction  of  the  background  stream  with  the  queue  levels  in  the  event  driven 
simulation  has  to  be  modeled.  This  is  done  by  introducing  the  buffer  flow  probability,  a  one -dimensional 
stochastic  variable  that  describes  the  probability  that  the  amount  of  work  in  the  queue  is  bigger  than  a  certain 
value.  This  technique  evades  the  challenge  of  modeling  the  interaction  between  the  packet  streams  and  the 
fluid  flows. 

Related  work.  The  hybrid  technique  in  which  packet  event  simulation  and  fluid  flows  approximations  are 
combined,  is  a  recent  development.  Some  approaches  use  different  simulators  for  the  foreground  traffic  and 
the  background  stream  [22],  other  separate  the  network  in  packet  level  parts  and  fluid  parts  [13].  There  exist 
two  simulators  [19,  15]  that  integrate  the  Monte-Carlo  simulation  and  the  background  model  into  the  same 
simulator.  Both  use  the  behaviour  of  the  transport  layer  model  to  model  the  fluid  flow.  Traffic  is  simulated 
as  an  incompressible  fluids,  flowing  among  storage  tanks  (the  buffers).  Open-loop  (UDP)  traffic  is  generated 
by  an  Exponential  On/Off  baffle  generator  and  the  closed-loop  (TCP)  source  model  is  a  simplified  version 
of  TCP  Reno.  A  complex,  not  always  well  documented  approach  is  needed  to  synchronize  the  foreground 
traffic  and  the  background  sbeam  and  to  model  the  interaction  of  the  the  fluid  flow  approximation  with  the 
event-driven  packet  simulation  and  vice  versa. 

Literature.  The  approach  in  this  paper  has  inherently  a  many  flows  paradigm,  which  is  quite  acceptable 
for  the  majority  of  the  networks.  A  solution  of  the  basic  queue  equation  for  a  general  traffic  trace  is  im¬ 
possible  [11].  An  asymptotic  approximation  is  the  best  one  can  get.  In  the  literature  two  main  directions 

'The  emulation  of  SDH  circuits  over  Ethernet  or  IP  are  hot  topics  in  the  carrier  world  where  the  Internet  Service  Providers  like 
to  migrate  to  one  link  layer. 
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are  developed.  The  first  considers  the  aggregation  of  many  flows  as  a  Gaussian  process  and  is  based  on  the 
Central  Limit  Theorem  [2,  4],  The  second  estimates  that  the  possibility  of  a  buffer  overflow  is  a  rare  event 
and  uses  the  Large  Deviations  theory  [9,  10,  18,  1 1].  Both  can  have  different  scaling  regimes.  The  most  im¬ 
portant  are  the  fast  time  scaling  and  the  many  sources  scaling  [11].  Fast  time  scaling  looks  at  a  speeded-up 
version  of  the  arrival  process  of  one  basic  flow  whether  the  many  sources  scaling  considers  the  sum  of  N 
independent  copies  of  the  basic  flow.  The  former  gives  bad  results  when  the  buffer  size  is  small  compared 
to  the  transmission  capacity.  This  is  not  acceptable  and  only  the  many  flows  scaling  [25,  11]  is  examined 
for  both  the  Central  Limit  theorem  and  the  Large  Deviations  theory.  Both  limits  have  however  their  strength 
and  their  weakness  in  regard  to  a  traffic  trace  if  no  assumptions  can  be  made  about  the  distributions  of  the 
individual  flows.  Several  attempts  where  made  to  combine  both  and  get  “the  best  of  both  worlds”:  analyti¬ 
cally  tractability  and  minimal  statistical  information  to  be  extracted  from  a  traffic  trace  [26].  The  Maximum 
Variance  Asymptotic  [16,  6,  5,  7],  the  Most  Probable  Path  for  Gaussian  Inputs  [27,  1]  and  the  Moderate 
Deviations  scaling  [11,  26]  answer  this  problem  from  a  different  viewpoint.  After  translating  the  different 
theories  in  one  language  the  same  kind  of  equation  is  found  but  the  application  domain  is  very  different. 

Contribution.  In  this  paper,  a  unified  framework  for  the  queueing  behaviour  of  many  flows  fluid  traffic 
models  is  presented.  These  model  are  all  independent  of  the  higher  layer  mechanics  as  TCP  or  UDP  and 
the  results  are  computed  using  real  traffic  traces.  The  estimates  based  on  the  Large  Deviations  asymptotic, 
the  Central  Limit  theorem  and  the  Moderate  Deviations  limit  are  described  with  the  emphasis  rather  on  the 
key  ideas  behind  the  approximations  than  on  the  rigorous  mathematical  details.  The  latter  can  be  read  in 
the  papers  found  in  the  bibliography.  This  work  gives  an  uniform  approach  to  the  different  estimates  to 
obtain  a  fair  analysis  of  the  accuracy  and  ease  of  computation.  The  appropriate  domains  for  each  method 
is  examined.  The  reference  for  the  benchmarks  is  a  Monte-Carlo  simulation.  Both  synthetic  trace,  periodic 
on-off  sources,  and  real  trace,  the  famous  Bellcore  dataset  [17]  and  the  Star  Wars  MPEG-1  trace  tile  [23], 
are  used.  The  results  are  presented  as  optimal  domains  for  each  technique  based  on  workload,  capacity  and 
buffer  size.  This  is  the  first  systematic  evaluation  of  estimates  for  background  fluid  models  independent  of 
the  higher-layer  mechanics  and  integrating  the  live  behaviour  of  real  traffic  traces. 

Organization.  This  paper  is  organized  as  follows.  In  section  2  the  basic  queueing  model  is  introduced.  An 
estimate  based  on  the  Large  Deviations  (LD)  theory  is  developed  in  section  3  whereas  section  4  details  the 
Central  Limit  Theorem  (CLT)  approximation.  Section  5  gives  an  overview  of  the  different  approaches  to 
Moderate  Deviations  (MD).  All  estimates  are  compared  with  a  discrete  event  simulation  and  the  numerical 
results  are  presented  in  section  6. 


2.0  BASIC  QUEUEING  MODEL 

In  order  to  make  the  discrete  event  simulation  aware  of  the  background  traffic,  the  packet  level  network  traces 
are  translated  in  a  buffer  flow  occupation  probability.  This  queue  indicator  gives  the  probability  that  the 
amount  of  work  in  a  network  device  due  to  the  fluid  data  is  more  than  a  certain  value.  It  can  be  considered  as 
a  stochastic  variable  depending  on  one  parameter.  For  each  network  element,  i.e.  each  queue,  this  probability 
has  to  be  calculated  separately  because  the  capacity  of  the  link,  the  buffer  size  and  the  composition  of  the 
traffic  stream  can  be  different. 

Packet-based  data  networks  are  easily  modeled  by  queueing  systems.  Data  is  parceled  up  into  packets  and 
these  are  sent  over  wires.  At  points  where  several  wires2  meet,  incoming  packets  are  queued  up,  inspected, 

2  A  wireless  network  can  be  regarded  as  one  generalized  processor  sharing  queueing  system  for  all  the  devices  in  transmission 
range.  A  paper  is  being  prepared  to  be  submitted  in  2007  about  an  extension  of  the  theory  to  wireless  base  stations  and  ad-hoc 
networks. 
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and  sent  out  over  the  appropriate  wire.  When  the  total  number  of  traffic  units,  i.e.  bytes,  packets  or  cells, 
reaches  the  buffer  size,  incoming  packets  are  discarded. 

The  basic  FIFO3  queueing  system  can  be  quantified  using  the  modified4  Lindley’s  recursion  [11] 

Qn  =  [Qn-l+An-Cn]B0  (1) 

where  [jc]q  =  max(min(x,,B),0).  Qn  can  be  interpreted  as  the  amount  of  work  in  the  queue  just  after  time 
n  €  Z,  An  as  the  arrival  process,  i.e.  the  number  of  traffic  units  which  arrive  in  the  interval  in  —  I ./?),  Cn  as 
the  capacity  of  the  link,  i.e.  the  number  of  traffic  units  served  between  n  —  1  and  n,  and  B  the  buffer  size  also 
expressed  in  traffic  units. 

To  solve  this  recursion  equation  the  following  assumptions  are  made: 

•  C„  is  independent  of  n; 

•  the  arrival  process  is  stationary,  i.e.  (A_„, . . .  ,Ao)  has  the  same  distribution  as  (A_„_m, . . .  ,A_m)  for 
every  n  and  m; 

•  the  mean  traffic  rate  is  lower  than  the  capacity,  i.e.  E  (A„)  <  Cn\ 

•  the  queue  is  empty  at  time  —  °o. 

The  capacity  of  a  network  link  can  be  considered  to  have  a  constant  value5.  Measurements  of  traffic  traces 
on  real  networks  have  shown  that  real  traffic  can  be  long-range  dependent  [17]  but  is  mostly  stationary.  If 
the  mean  traffic  rate  is  higher  than  the  capacity,  the  queue  will  never  empty  and  the  recursion  is  not  stable. 
The  emptiness  of  the  queue  at  infinity  is  implicit,  i.e.  the  boot  up  of  the  device. 

With  these  constraints,^,,  is  also  stationary  and  its  distribution  is  called  the  steady  state  distribution  of  the 
queue  size 

Q  =  sup  (St  —  Ct)  (2) 

teN 

where  C  is  the  tranmission  capacity,  S,  =  £°=„f+1  A„  and  .S'o  =  0.  The  queue  indicator  is  the  probability  that 
Q  is  larger  than  X 


Pr(<2  >X)  =  Pr  (  sup  (St  -Ct)>X 
\t>  o 


(3) 


Only  in  certain  cases6  this  probability  can  be  calculated  analytically.  For  real  traffic  traces  numerical  ap¬ 
proaches  have  to  be  used  to  find  an  estimation.  All  models  that  are  detailed  in  this  paper,  try  do  find  an  exact 
asymptotic  for  the  buffer  flow  probability. 

To  test  the  accuracy  of  the  approximations  an  event-driven  simulation  of  the  queue  with  the  same  arrival 
process  as  the  fluid  traffic  is  used.  The  result  of  this  simulation  described  in  the  literature  [18]  is  however  not 
the  buffer  overflow  probability  {BOP),  i.e.  the  buffer  flow  probability  with  X  =  B,  but  the  loss  rate  ( LR )  [18] 
of  packets  expressed  in  traffic  units. 

E((Q+A-(C+B)}0) 


LR  = 


E(A) 


(4) 


3  First  In  First  Out 

4The  original  Lindley’s  recursion  Qt  =  (Qt-i  +A,  —  C,)+  is  extended  to  take  into  account  the  limited  buffer  size.  x+  denotes  the 
positive  part  of  x,  i.e.  max[x,0). 

5  In  a  real  network  the  capacity  can  be  variable  but  the  speed  of  change  is  considered  to  be  much  slower  than  the  characteristic 
time-scales  that  determine  the  queueing  behaviour. 

6For  basic  arrival  distributions,  i.e  Poisson  traffic,  the  calculation  can  be  done  but  real  traffic  tends  to  behave  differently  [21], 
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Both  the  BOP  and  the  LR  calculated  for  the  different  approaches  and  they  are  compared  with  the  event  driven 
simulation.  The  BOP  is  obtained  directly  from  an  event  driven  simulation  by  measuring  the  total  time  that 
the  buffer  is  in  overflow  compared  to  the  total  simulation  time. 


3.0  LARGE  DEVIATIONS 

Large  Deviations  theory  has  many  applications  and  can  successfully  be  applied  to  networks.  The  most 
interesting7  limiting  regime  considers  what  happens  when  a  queue  is  shared  by  a  large  number  of  independent 
traffic  flows  (also  called  sources). 

Consider  the  single  server  queue  as  before,  with  N  sources  and  constant  service  rate  C  =  cN.  Let  A„  be  the 
amount  of  work  arriving  from  source  i  at  time  n.  Assume  that  for  each  i,  ,  n  £  Z  j  is  a  stationary  sequence 
of  random  variables,  and  that  these  sequences  are  independent  of  each  other  but  identically  distributed.  This 
can  be  rephrased  in  the  terminology  of  previous  section 

Q  =  sup  (St  —  cNt)  (5) 

teN 

where  An  =  A„\  S,  =  £9=_f+1  A„  and  So  =  0.  So  St  is  the  total  amount  of  work  arriving  at  the  queue  in 

the  interval  ( — f ,  0] . 

The  buffer  flow  probability  can  be  considered  as  a  queue  indicator  in  the  event  driven  simulation 

Pr(£>X)  =Pr(  sup  (S,-cNt)  >xN)  (6) 

VreN  J 

here  is  X  =  xN.  The  principle  of  the  Largest  Term  that  plays  an  important  role  in  general  Large  Deviations 
theory  [11],  can  be  applied  to  the  previous  expression 

Pr(Q>X)=  sup Pr  (St  -  cNt  >  xN)  (7) 

teN 

The  Chernoff ’s  bound  [25]  for  the  upper  bound  and  the  Cramer’s  theorem  [25]  for  the  lower  bound  can  be 
used 


-/(*+)  <  lim  inf/v^oo  ±  log  (Pr(Q  >  A))  (8) 

limsupA,^00Plog(Pr(2  >X))  <-/(*-)  (9) 

where  .S')' 1  ^  =  £°=_r+ 1  a}/  '* ,  sj,1  *  =  0,  a)  I  !(.s-)  =  log  EeiS,<  ' ,  the  cumulant  generating  function8  of  1  ^ ,  a|  1  ^  (x  + 
ct)  =  supigM+  (^s(x  +  ct)  —A<',l\s)Sj ,  the  convex  conjugate9  of  A^(s)  and  I(x)  =  inf(eN ^(x  +  ct),  the  rate 
function.  The  calculation  is  based  on  following  assumptions: 

•  for  all  t.  A,1  (j)  is  finite  for  ,v  in  a  neighborhood  of  the  origin; 

•  A(P(s)  =  lim,  |a)I;(.v)  exist  and  is  finite  and  differentiable,  for  ,s  in  a  neighborhood  of  the  origin. 

7  The  simpler  large  buffer  asymptotic  [11]  can  not  be  used  due  to  the  possible  small  values  of  X. 

8  also  known  as  the  logarithmic  moment  generating  function 
9also  known  as  the  Fenchel-Legendre  transform 
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The  general  ideas  behind  this  formula  can  be  summarized  in  two  principles: 

•  application  of  the  principle  of  the  Largest  Term  to  move  the  sup,  out  of  the  probability; 

•  approximation  of  the  tail  of  the  distribution  with  an  exponential,  based  on  a  large  deviations  result. 

This  shows  that  inaccuracy  can  occur  in  two  ways  [1 1],  The  principle  of  the  Largest  Term  is  very  accurate 
for  a  moderate  traffic  rate  compared  to  the  transmission  capacity  and  provides  a  lower  bound  for  higher 
traffic  rates.  The  exponential  curve  approaches  nicely  the  buffer  flow  probability  for  low  traffic  rates  but 
overestimates  for  traffic  rates  near  the  link  rate. 

As  does  Cramer’s  theorem,  this  result  involves  both  an  upper  and  a  lower  bound.  If  A,  ^(s)  is  continuous  in 
s  for  each  t,  then  the  two  bounds  agree  and  a  straightforward  approximations  can  be  obtained 

Pr(Q  >  X)  «  e-4f(^M%))  (10) 

where  st  =  argsupveR+  (s(b  +  ct)  -  A ,(1)(s))  and  t  =  arginffeN  (s,(b  +  ct)  -  A f(1) 

There  was  assumed  that  f_1A^(s)  converges  to  a  limit  A':  1 !  (.v) ,  and  yet  A:  1  -j  (.v)  does  not  appear  anywhere 
in  the  result.  This  assumption  is  just  a  easy  way  to  control  the  tails  so  that  there  are  no  surprises  in  the 
distribution  of  S,1'1  for  large  t.  This  is  only  needed  to  prove  the  upper  bound  but  not  the  lower  bound.  All  the 
measured  traffic  traces  allow  the  convergence  of  this  limit. 

The  principle  of  the  Largest  Term  is  used  for  the  deduction  of  both  the  upper  and  the  lower  limit.  The 
motivation  behind  this  principle  is  the  idea  that  to  estimate  the  probability  of  a  rare  event,  only  the  most 
likely  path  in  which  this  event  can  occur,  has  to  be  considered.  For  the  lower  bound,  the  fixed  time  t  is 
the  most  likely  duration  starting  from  an  empty  buffer  that  the  workload  in  the  queue  exceeds  the  value 
x.  The  upper  bound  shows  that  the  probability  of  a  workload  exceeding  a  level  x  over  a  time  period  t  ^  t 
is  negligible  on  a  logarithmic  scale.  The  principle  of  the  Largest  Term  can  be  used  to  obtain  directly  a 
numerical  estimate  for  the  buffer  flow  probability  but  the  calculation  for  real  traffic  traces  is  both  memory 
and  processor  intensive. 

The  assumptions  that  the  A,  are  independent  and  identical  are  overly  restrictive.  General  Large  Deviations 
theory  gives  a  more  sophisticated  theorem!  1 1],  with  less  restrictions.  The  queueing  behaviour  of  traffic  with 
a  random  composition  will  be  dominated  by  the  most  bursty  source  over  a  certain  period  t.  Experimental 
setups  have  shown  that  the  buffer  flow  probability  of  real  network  traces  successfully  can  be  approximated 
by  this  theory.  The  knowledge  of  the  number  of  sources  or  the  different  traffic  types  is  not  necessary  [10] 

Pr{Q  >  X)  «  eVfB+cO-AOf)  (l  1) 

where  Af(s)  is  the  cumulant  generating  function  of  the  traffic  mix,  s,  =  argsupi6R+  (s(B  +  Ct)  —At(s))  and 
t  =  arginffeN (st(B  +  Ct)  —  A t(st)).  st  =  argsupigR+  (s(B  +  Ct)  —  A t(s))  can  easily  be  computed  with  the 

method  of  Brent  exploiting  the  concavity  of  A,  in  5  so  that  siB  +  Ct)  —  A  t{s)  is  also  concave  in  s  [8].  A  linear 

search  over  the  discrete  time  intervals  can  be  used  to  minimize  the  equation  in  respect  to  t. 

The  cumulant  generating  function  can  be  transformed  in  a  convenient  and  intuitive  descriptor  of  the  stochas¬ 
tic  properties  of  the  traffic  stream,  called  the  Effective  Bandwidth  (a) 11  (s))  of  the  flow  [14]. 

a(il\s)  =  yA(t1\s)  (12) 
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Suppose  the  optimum  is  attained,  and  the  optimizing  parameters  are  Sf  and  t,  both  strictly  positive.  A  small 
number  of  the  N  flows  can  be  replaced  by  constant-rate  flows  of  rate  A 1  (sj) .  Locally,  at  these 

new  flows  have  the  same  cumulant  generating  function  as  the  flows  they  replace,  so  the  rate  functions  are 
the  same10.  In  other  words,  a  flow  with  cumulant  generating  function  Af  1 ;  has  the  same  effect  at  operating 
point  as  a  flow  of  constant  rate  of 1 :  (s,) .  The  effective  bandwidth  is  additive  for  independent  sources. 

Alternative,  the  effective  bandwidth  can  be  understood  in  terms  of  admission  regions.  Suppose  there  are  mN 
flows  with  effective  bandwidth  a t(s)  and  nN  flows  with  effective  bandwidth  p,(.v).  For  what  values  of  m  and 
n  does  the  system  meet  the  quality  of  service  constraint  y? 

Pr  (Qn  >  Nx)  <  (13) 


The  admissible  region  is 


P|  <  m,n  :  3s  >  0  :  mat(s)  +  n(3*(s)  <  c+  - 

f>0  ^  1 


(14) 


The  effective  bandwidth  gives  the  trade-off  between  flows  of  different  types  [14]. 

Let  A(1)  be  a  typical  input  flow  in  a  queue  fed  by  N  independent  and  identically  distributed  flows  and  served 
at  rate  cN,  and  let  D  1  *  the  corresponding  departure  flow.  At 1 '  is  not  described  via  a  large  deviations  principle, 
neither  is  D( A  Instead  AN  obeys  a  large  deviations  principle  and  so  does  D'v.  The  surprising  result  says  that 
Dn  satisfies  the  same  large  deviations  principle  as  AN .  In  a  large  deviations  sense,  the  characteristics  of  a 
flow  of  traffic  are  not  changed  as  it  passes  through  a  queue.  Consider  now  a  queue  fed  by  many  independent 
flows  of  different  types:  N  flows  like  A 1 1  and  N  flows  like  B  1 Let  A(1)  and  A 1 !  be  typical  outputs.  If 
the  total  mean  arrival  rate  is  less  than  the  transmission  capacity,  then  DN  satisfies  the  same  large  deviations 
principle  as  AN  and  EN  the  same  as  BN,  which  means  that  in  a  heuristic  sense  D  1 '  A 1 1  and  A1  /A1.  By 
considering  the  additive  behaviour  of  the  cumulant  generating  function  it  is  clear  that  A 1 :  and  A 1 J  are 
essentially  independent.  The  contrary  might  be  expected.  For  example,  if  A(  1  *  is  very  bursty  and  B  1  is 
rather  smooth,  on  might  expect  that  D  1 1  be  less  bursty  and  A 1 1  be  less  smooth.  Indeed  in  a  router  with  a 
small  number  of  inputs  this  can  happen.  But  in  the  many  flows  scaling  regime11  it  is  not  the  case.  In  other 
words,  D' 1 :  and  A  1 1  do  not  depend  on  the  traffic  mix  at  the  router  as  long  as  the  queue  empties  regularly  with 
high  probability.  This  is  known  as  decoupling  [24].  In  the  large  deviations  limit,  it  make  sense  to  talk  about 
the  effective  bandwidth  of  a  single  flow  through  a  network  as  long  as  in  each  network  device  the  service  rate 
is  higher  than  the  mean  arrival  rate  and  the  effective  bandwidth  of  the  departure  flow  at  the  last  device  will 
exactly  be  the  same  as  the  effective  bandwidth  of  the  arrival  flow  at  the  first  device.  This  is  nice  property 
allowing  straightforward  traffic  engineering. 

The  time  parameter  t  corresponds  to  the  buffer  busy  period  before  reaching  Nx.  This  is  the  time  scale  for 
buffer  overflow  probability,  Nx  =  Nb.  Smoothing  of  traffic  will  only  be  effective  at  a  time  scale  larger  than 
t  because  only  then  it  affects  the  effective  bandwidth  of  the  traffic  flow  [9],  This  parameter  is  also  important 
for  the  granularity  of  the  samples  from  a  real  traffic  trace.  The  sampling  time  has  to  be  less  than  t  to  have 
an  accurate  estimate  but  a  value  several  orders  less  than  t  will  have  a  big  impact  on  the  performance  of  the 
calculation  of  A t(s).  A  ripple  effect  can  be  seen  in  the  effective  bandwidth  surface  for  periodic  values  of  t 
over  the  full  s-range.  This  indicates  a  periodic  behaviour  of  the  traffic  source  with  a  period  equal  to  the  time 
t  of  the  first  ripple. 

The  space  scale  s  is  a  parameter  for  the  degree  of  multiplexing  [12].  The  more  s  approaches  0  the  multi¬ 
plexing  will  be  more  efficient  and  the  effective  parameter  tends  to  the  mean  rate  of  the  traffic  stream  at  the 

10to  the  first  order  and  under  appropriate  smoothness  conditions 

"Experimental  setups  have  shown  that  already  for  a  rather  small  number  of  flows  N  =  3  for  two  traffic  classes,  the  departure 
flows  are  mostly  decoupled  [24]. 
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appropriate  time  scale.  If  5  becomes  large,  the  flows  will  not  multiplex  very  well  and  the  effective  bandwidth 
will  be  close  to  the  peak  rate  of  the  traffic  stream  at  the  corresponding  time  scale.  For  a  fix  t,  the  effective 
bandwidth  is  strictly  concave  going  from  the  mean  rate  to  the  peak  rate  of  the  traffic  flow  at  the  time  scale  t. 

The  basic  estimate  Pr  ( QN  >  Nx)  <  e  /A  can  be  refined  by  using  the  Bahadur-Rao  theorem  [18].  Conditions 
can  be  found  which  allow  a  tighter  limit  on  the  buffer  flow  probability.  The  result  is  an  exact  asymptotic 
solution  for  the  queueing  system 


Pr  ((f  >  Nx)  « - .  1  q-n,(x) 

SfJlnOfN 


(15) 


where  I(x)  =  inffeN  sup seR+ /,(£,.*)  with  Jt(s,x)  =  s(x  +  ct )  -  £"=1  p'AfW(s), 
flows  from  class  i  and  a?  =  d  ^2’^  |  (t  =  t,s  =  s?) .  st  can  easily  be  found  as  arg 


p'  =  n1  /N  ,  nl  the  number  of 
x  +  ct  =  I-L,  and 


■  d2A^  (  ) 

the  expression  for  a?  simplifies  to  —  L"=i  P( — d'v2~  |  (t  =  t,s  =  sf) .  A  further  simplification  gives  an  ad-hoc 

approximation  [20],  which  is  exact  for  the  case  of  Gaussian  arrival  processes  a?  ~  A  final  expression 
for  the  buffer  flow  probability  can  be  obtained  by  entering  in  equation  (15)  this  result  and  setting  N  =  1 
because  the  number  of  flows  or  the  exact  composition  of  the  flows  are  not  needed 


Pr(<2>X) 


1  e-f(x) 
^4k/(X) 


(16) 


where  I(X )  =  inffeN supseR+  +  Ct)  -  £"=  x  n'A\'\s)^j 

The  loss  rate  can  be  found  for  the  exact  asymptotic  by  using  Petrov’s  Theorem  [18].  Using  the  same  approx¬ 
imation  as  equation  (16),  the  loss  rate  becomes 


LR  « - 1  e~/(g) 


(17) 


where  q  =  E  (At)  is  the  mean  traffic  rate  of  the  aggregated  traffic  stream.  This  final  equation  will  be  the  used 
to  test  the  accuracy  of  the  large  deviations  limit. 

A  trade  off  is  done  between  exactness  and  speed  of  computation: 


•  the  number  of  flows  or  the  exact  traffic  mix  are  not  needed; 

•  the  probability  is  approximated  with  an  exponential; 

•  a  refined  estimate  based  on  the  Bahadur-Rao  theorem  and  Petrov’s  theorem  is  used; 

•  Of(s()  is  approximated  by  supposing  that  the  aggregated  traffic  stream  is  Gaussian; 

•  decoupling  makes  it  easy  to  follow  a  stream  passing  through  a  network; 

•  s  gives  an  indication  how  well  the  traffic  can  be  multiplexed  on  a  link; 

•  the  full  statistical  characteristics  of  a  traffic  flow  have  to  be  known; 

•  a  sampling  procedure  is  needed  to  obtain  an  estimated  effective  bandwidth  or  cumulant  generating 
function  from  a  real  traffic  trace. 
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4.0  CENTRAL  LIMIT 


Heavy  Traffic12  theory  relies  on  the  fact  that  the  amount  of  traffic  in  a  certain  sampling  interval  due  to  the 
aggregation  of  many  flows  onto  a  link  tends  to  approach  a  Gaussian  distribution.  This  is  an  application 
of  the  Central  Limit  theorem  [2]  and  relies  on  the  assumption  that  the  variance  of  the  component  traffic 
streams  is  uniformly  bounded.  Denote  the  component  traffic  streams  in  a  network  by  A„  ,  the  aggregation 
by  An  =  YjLiAn  and  the  cumulative  arrival  process  by  St  =  Y?n=-t+\  A;  where  ,S'o  =  0.  S,  can  be  seen  as  a 
vector  with  index  t  and  every  component  of  this  vector  has  a  Gaussian  distribution  in  respect  to  the  Central 
Limit  theorem 

St  =  *C(nt,oj)  (18) 

where  p  =  E(A)  is  the  mean  traffic  rate  of  the  aggregated  traffic  stream,  E  (.S’,  J  =  pf  and  aj  =  var  (S, ) . 

It  can  be  shown  that  as  the  aggregation  takes  place,  that  virtually  all  performance  measures  of  multiplexers 
and  switches  in  a  network  individually  or  in  combination,  including  loss,  delay  and  jitter,  must  approach 
the  performance  measures  of  a  communication  system  carrying  Gaussian  traffic  with  the  same  second  order 
statistics.  If  sufficient  independent  sources  of  traffic  contribute  to  a  network,  it  is  reasonable  to  analyze  the 
network  using  a  Gaussian  model,  matching  the  second  order  statistics  of  the  model  to  the  real  traffic  data. 

Consider  the  single  server  queue  as  before,  with  constant  service  rate  C  and  the  aggregate  traffic  with  a 
Gaussian  distribution.  This  can  be  formulated  in  the  basic  queueing  model 

Q  =  sup  (St  —  Ct)  (19) 

f€N 

The  buffer  flow  probability  can  be  chosen  as  a  queue  indicator  in  the  event  driven  simulation 


Pr(<2>*) 


Pr  f  sup  (5,  —  Ct)  >X 

\te  N 


Pr  sup 

\«eN 


> 


X  +  (C-ju)t 


'o 7 


(20) 

(21) 


Assuming  that  a2/t2  —>  0  as  t  — >  0,  oj/(X  +  ( C — p)t )2  attains  its  maximum  value  at  some  finite  t  =  t  [5]  and 
using  the  principle  of  the  Largest  Term 


Pr  (Q>X) 


sup 


> 


X  +  (C-ju)t 


/Gt 


fx  +  (c-fi)i\ 

\  i 


(22) 

(23) 


where  'P(w)  =  ^  /“  exp  ( —z2/ 2)  dr,  the  standard  normalized  Gaussian  tail  function,  and  t  =  sup  arg;eN  oj/(X  + 
(C  —  q)f)2.  The  qualitative  statement  “rare  events  happen  only  in  the  most  probably  way”  suggests  that  the 
previous  equation  is  a  good  lower  bound  approximation. 

The  Principle  of  the  Largest  Term  is  used  to  get  this  estimate.  This  seems  at  first  sight  a  contradiction. 
How  can  for  a  queue  filled  with  Heavy  Traffic,  i.e.  the  mean  rate  close  to  the  transmission  capacity,  the 

12  Heavy  Traffic  limit  is  mostly  used  for  the  Fast  Time  Central  Limit  Theory  or  Diffusion  Approximation  which  has  a  rather  bad 
performance  for  a  strong  correlated  source.  In  this  paper  the  term  Heavy  Traffic  is  also  used  for  the  Many  Flows  Central  Limit 
Theory. 
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probability  that  the  work  in  queue  exceeds  the  buffer  size,  be  a  rare  event?  The  answer  is  related  to  the 
space-scale  s  of  the  Large  Deviations  Asymptotic.  A  low  s  means  a  high  multiplexing  gain  and  the  effective 
bandwidth  is  close  to  the  mean  rate.  The  same  phenomenon  happens  in  the  Heavy  Traffic  case.  When  many 
independent  flows  are  aggregated  and  their  mean  is  less  than  the  transmission  capacity,  the  mixture  has  a 
better  performance  than  the  individual  traffic  streams  even  for  a  limited  buffer  size.  The  supremum  time- 
scale  t  is  however  in  the  case  of  a  small  buffer  not  necessary  unique  and  multiple  parallel  paths  to  overflow 
can  exist.  If  the  buffer  size  becomes  0,  a  convenient  approximation  [1]  can  be  used 


/  2  r°°  k-v)2 

Pr (£  >  0)  «  2 Pr (A  >  C)  =  a/  — ^  /  e  S*  dz 

V  no- Jc 

where  q  =  E  (A)  is  the  mean  traffic  rate  of  the  aggregated  traffic  stream  and  a2  =  var  (A)  the  variance  of  the 
aggregated  traffic  stream.  The  heuristic  motivation  that  during  busy  periods,  the  buffer  goes  down  roughly 
as  often  as  it  goes  up,  is  straightforward.  This  can  be  used  to  rescale  the  lower  bound. 

To  ease  the  computation  of  the  integral,  bounds  for  vP(vv)  can  be  used  to  calculate  the  buffer  flow  probability. 


1 —w~ 

— e  2 

y/2iz 


<  'P(w)  < 


1  1  VC 

—p =w  e  2 

V271 


(24) 


The  upper  bound  reappears  in  the  next  session  where  the  rare  event  hypothesis  will  be  quantified. 

Most  other  approaches  calculate  the  limit  in  the  assumption  that  the  base  streams  arc  of  a  specific  kind13 
without  the  principle  of  the  Largest  Term.  The  resulting  equation  is  used  to  estimate  the  behaviour  of  mea¬ 
sured  traffic.  The  presented  results  are  mostly  very  good  due  to  the  specific  choice  of  data  but  in  general  it  is 
impossible  to  predict  whether  the  used  formula  is  a  good  estimate.  In  this  paper  the  general  lower  bound  is 
used  as  queue  indicator  without  any  assumptions  about  the  distribution  of  the  basic  data  streams. 

To  test  the  performance  of  the  Heavy  Traffic  limit,  the  loss  rate  has  to  be  evaluated.  The  approach  is  quite 
different  compared  to  the  Large  Deviations  Approximation.  The  key  idea  is  that  the  shape  of  the  curves 
representing  the  buffer  overflow  probability  and  the  loss  rate  arc  similar  [16].  If  there  is  a  good  estimate  of 
the  tail  probability  an  a  way  to  calculate  LR(A)  then  LR(B)  can  be  calculated  as 

“(y)=sif)oPr(e£!')  (25) 

with  the  calculation  of  Pr(<2  <  X )  and  Pr(<2  <  Y)  based  on  previous  equations.  For  LR( 0)  a  convenient 
formula  can  be  obtained 

1  r°°  (c-tf 

LR(0)  =  — ==  /  (z-C)e-A^dz  (26) 

pV27to2  Jc 

where  q  =  E  (A)  the  mean  traffic  rate  of  the  aggregated  traffic  stream  is  and  a2  =  var  (A)  the  variance  of  the 
aggregated  traffic  stream.  The  equation  uses  the  fact  that  if  B  =  0,  t  =  1. 

Again  a  trade-off  between  exact  values  an  ease  of  computation  is  made: 


•  the  number  of  flows  or  the  exact  traffic  mix  are  not  needed; 

•  the  aggregated  traffic  flow  is  assumed  to  be  Gaussian; 

13In  [27]  a  Poisson  Pareto  Burst  Process  (PPBP)  is  used  for  the  modeling  of  Internet  traffic. 
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•  a  straightforward  lower  bound  is  obtained; 

•  only  the  mean  and  the  variance  of  the  summed  aggregate  traffic  stream  have  to  be  calculated  from  a 
traffic  trace; 

•  for  small  values  of  X  the  approach  is  not  accurate; 

•  a  integral  that  can  only  be  solved  analytically  in  a  few  specific  cases,  has  to  be  calculated. 


5.0  MODERATE  DEVIATIONS 


The  estimate  based  on  the  Central  Limit  Theorem  is  appealing  because  it  leads  to  parsimonious  models,  i.e. 
all  that  is  needed  to  know  about  an  arrival  process  is  its  mean  an  covariance  structure.  Large  Deviations  says 
integrals  can  be  replaced  by  a  supremum.  Both  approaches  can  be  analytically  intractable:  the  first  because 
integrals  are  needed  to  calculate  probabilities  and  the  second  because  the  full  statistical  characteristics  of  a 
traffic  flow  are  unwieldy.  It  is  tempting  to  combine  the  two  techniques  [26]:  by  the  Central  Limit  Theorem 
the  arrival  process  can  be  modeled  as  a  Gaussian  process;  by  Large  Deviations  theory,  it  is  sufficient  to  study 
the  most  likely  paths. 

Several  theories  can  be  applied  to  get  the  same  equation: 


•  Most  Probable  Path  [1]:  application  of  Schindler’s  theorem,  a  Gaussian  Large  Deviations  result; 

•  Moderate  Deviations  [26]:  application  of  Gartner-Ellis  theorem,  also  a  Large  Deviations  result,  to 
general  traffic  streams; 

•  Maximum  Variance  Asymptotic  based  on  Extreme  Value  theory  [7]:  with  key  idea  that  the  local  be¬ 
haviour  of  the  normalized  variance  around  the  index  t  where  the  maximum  variance  is  achieved,  is 
found  to  essentially  determine  the  supremum  definition. 


The  approach  in  [26]  quantifies  with  a  burstiness  parameter  [3  the  relation  with  both  the  Central  Limit  The¬ 
orem  (p  =  0)  and  the  Large  Deviations  theory  (p  =  I )  .  Moderate  Deviations  concerns  a  range  of  scales 
between  these.  A  typical  traffic  flows  has  bursts  at  all  scales  0  <  P  <  1,  with  larger  bursts  (large  p)  less 
frequently  than  smaller  bursts  (small  p).  The  parameter  P  plays  the  dual  role:  it  measures  both  burst  size  and 
burst  frequency. 

Let  N  be  the  number  of  stationary,  identically  distributed  arrival  processes  A„  with  mean  rate  ft,  S,  = 
£|j=_(+1A„  where  .S'o  =  0  the  cumulative  arrival  process,  C  the  transmission  capacity  and  X  the  number 
of  traffic  units  in  the  buffer.  The  following  equation  can  be  found 


1 

7VP 


logPr 


=  -  inf 

f€N 


(. x  +  ct )2 

2  of 


where  x  =  X/N~^ ,  c=  (C— ftN) /jVt  and  aj  =  var(Sf^).  If  p  =  0,  the  probability  becomes  Pr  ( V  (qA7 ,  d;2  V) ) 
with  an  exponential  approximations  and  if  P  =  1,  the  result  resembles  a  Large  Deviations  result  for  a  Gaus¬ 
sian  distribution14. 

14For  a  Gaussian  arrival  process,  A f(s)  =  s/^t  +  and  the  result  follows  directly  by  taking  the  supremum  st  = 

arg  supigR+  (s(x  +  ct)  -  Af(  l)  (s)j . 
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A  straightforward  estimate  can  be  found  using  this  equation 


1  X+(C-,,)!)2 

Pr  (2>X)«e  2a?  (27) 

where  t  =  sup ai'g/r;Na2 /(X  +  (C  —  ,u)t)2  and  aj  =  N&j.  This  result  is  independent  of  N  and  p.  It  has  to  be 
clear  that  this  approximation  is  only  valid,  if  the  input  process,  C  and  X  stand  in  a  certain  relation  but  of 
course  the  estimate  can  always  be  calculated. 

As  is  the  case  for  Large  Deviations,  some  assumptions  have  to  be  made  to  control  the  tail  of  the  distribution. 
The  details  can  be  found  in  [1 1].  An  important  result  is  that  the  full  covariance  structure  y  can  be  recovered 
from  the  marginal  variances  <5j. 


To  =  c>7 

Yi  =  ^(02-0?) 

It  =  -  (a2+1  -  2a2  +  a2_ , )  for  t>l 

In  other  words,  the  marginal  distributions  of  the  cumulative  arrival  process  fully  characterize  the  process. 
The  same  refinement  as  for  the  Large  Deviations  estimate  can  be  applied  [26] 

Pr (Q  >  X)  «  ;  1  e~/(z)  (28) 

where  I(X)  =  and  p  =  juN  is  the  mean  traffic  rate  of  the  aggregated  traffic  stream. 

itjr 

The  scale  procedure  based  on  Pr(<2  >  0)  ~  2Pr(A  >  C)  can  be  used  to  correct  the  buffer  flow  probability 
[1]- 

The  loss  rate  can  be  found  as  for  the  Large  Deviations  case 

LR  = - 1  e~/(B)  (29) 

sipv/4nI(B) 

or  the  same  procedure  as  for  the  Central  Limit  estimate  can  be  used  [16]  with  or  without  the  refinement. 

This  is  an  estimate  for  an  isolated  queue.  For  a  network  of  queues  the  parameter  p  can  be  interpreted  as 
a  low-pass  filter  [26].  The  loss  probability  at  a  queue  of  scale  P  will  be  of  the  order  of  L  P.  Thus  the  loss 
probability  for  a  flow  through  a  network  will  be  dominated  by  the  loss  probability  at  the  queue  along  the  path 
with  the  smallest  scale  Pmin.  This  can  be  called  the  bottleneck  link  of  the  flow.  By  the  low-pass  filter  result, 
traffic  is  essentially  unchanged  at  scales  less  than  or  equal  to  pmm  until  it  reaches  the  bottleneck  link.  The 
approximation  for  loss  rate  can  be  used  at  the  bottleneck  link  without  taking  into  account  any  smoothing. 
The  buffer  flow  probability  or  loss  rate  of  a  flow  through  a  network  can  be  approximated  by  estimating  the 
buffer  flow  probability  at  the  bottleneck  link. 

The  Maximum  Variance  Asymptotic  finds  the  same  equation  of  the  buffer  flow  probability  for  a  general 
class  of  Gaussian  processes  with  stationary  increments  [4],  including  a  large  class  of  long-range  dependent 
processes  with  a  =  lim,_>0dog(a2)/log(t),  assuming  that  this  limit  exists,  a  can  not  be  greater  than  2 
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from  the  stationary  increment  property.  The  interval  a  £  [0, 2]  covers  the  majority  of  non-trivial  Gaussian 
processes  with  stationary  increments.  Following  conditions  have  to  be  respected: 


linif^ocT^  =  a 

(30) 

d2a2 

lim =  a 

(31) 

a2  ~  Kta 

for  some  K  >  0 

(32) 

lim  sup nof  <  -°o 

for  some  a  £  (0,  a) 

(33) 

The  first  two  conditions  are  a  direct  result  of  the  definition  of  a.  The  third  condition  is  closely  related  to  the 
self-similarity  of  St15.  The  last  condition  is  about  the  behaviour  of  <jj  around  t  =  0,  and  it  is  satisfied  if  of 
decreases  as  fast,  or  faster  than  ta  for  some  positive  a  as  t  |  0. 

For  sufficiently  large  X,  log  (g2/(X  +  (C  — p)t)2)  is  strictly  concave  on  [(a  — y/a/2)X/ (2  — a) (C—fj),  (a  + 
yj a/2)X/(2  —  a)(C  —  jti)],  and  there  is  a  unique  index  t  where  it  attains  its  maximum  [6].  The  buffer  flow 
probability  and  t  can  be  computed  by  performing  a  simple  local  search  algorithm  starting  at  aX  /  (2  —  a )  (C  — 
p) .  Even  for  fairly  small  numbers  of  X  this  can  be  used  because  g2/(X  +  (C  —  fS)t )2  is  usually  of  a  distinctly 
uni-modal  shape.  For  small  X  the  minimizer  may  not  be  unique.  The  formula  for  the  calculation  of  Pr(<2  <  X ) 
however  does  not  depend  on  the  uniqueness  of  t. 

2 

The  upper  bound  for  the  estimate  from  the  Central  Limit  Theorem  — ^  w  “ 1  e  2  is  exponentially  the  same 
as  the  result  from  the  Moderate  Deviations  scaling.  The  prefactor  varies  much  slower  than  the 

exponential  part.  The  fact  that  for  sufficiently  large  X,  log  (a2 /(X  +  (C  —  /u)t )2)  is  strictly  concave,  confirms 
and  strengthens  the  reasoning  to  obtain  the  lower  bound  in  the  Central  Limit  Theorem  approach. 

It  seems  that  the  Moderate  Deviations  estimate  can  be  the  best  of  both  worlds: 

•  the  number  of  flows  or  the  exact  traffic  mix  are  not  needed; 

•  the  aggregated  traffic  flow  is  assumed  to  be  Gaussian; 

•  the  probability  is  approximated  with  an  exponential; 

•  a  refined  estimate  based  on  the  Bahadur-Rao  theorem  and  Petrov’s  theorem  can  be  used  and  a  smart 
rescale  can  be  applied; 

•  the  interpretation  of  the  burst  parameter  P  as  a  low-pass  filter  makes  the  calculation  of  the  loss  rate  in 
a  network  straightforward; 

•  only  the  mean  and  the  variance  of  the  summed  aggregate  traffic  stream  have  to  be  calculated  from  a 
traffic  trace; 

•  it  is  not  directly  clear  if  a  traffic  stream  has  the  right  scaling  behaviour  for  a  specific  queue. 


6.0  NUMERICAL  RESULTS 

To  validate  the  accuracy  of  the  formulas  (15,  27,  23),  both  the  BOP  and  the  LR ,  both  expressed  in  log  10,  are 
compared  with  simulations.  The  reported  results  have  a  maximum  traffic  unit  loss  probability  of  the  order 

15  The  Hurst  parameter  H  is  directly  related  to  the  parameter  a  by  2 H  =  a.  The  first  two  condition  can  be  used  to  obtain  an 
estimate  for  H. 
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10  6  to  obtain  a  reasonable  confidence  in  simulations,  expressed  as  an  95%  confidence  interval.  Importance 
sampling  needs  to  be  used  to  get  results  with  a  lower  traffic  unit  loss  rate.  The  plots  in  this  paper  are  only 
a  small  sample  of  the  experiments  that  are  conducted  during  the  research  for  an  optimal  background  fluid 
model. 

The  first  experiment  considers  the  multiplexing  of  periodic  on-off  sources  with  a  period  of  30  peak  rates 
denoted  by  K; ,  busy  periods  by  on;  and  composition  by  n,.  The  following  data  are  used 


Source,- 

n  i 

Ki 

on ; 

1 

10 N 

10 

3 

2 

4 N 

1 

10 

The  influence  of  N,  C  and  B  are  respectively  shown  in  the  figures  1,  2,  3,  4,  5  and  6.  The  parameters  for  N, 
C  and  B  are  visible  in  the  caption  of  each  figure. 

The  BOP  is  bounded  by  the  LD  as  upper  bound  and  the  CL  as  lower  bound.  The  Large  Deviations  Bahadur 
Rao  refinement  fits  the  simulated  probability  for  all  values  of  x  when  the  load  is  moderated.  For  a  high  load, 
the  CL  estimate  without  rescaling  approximates  very  well  the  simulated  buffer  occupation  probability. 

The  MD  and  MD  BR  estimates  for  both  the  BOP  as  the  CLR  are  an  approximation  first  for  respectively 
LD  and  LD  BR  and  if  N,  c  or  b  increases  for  respectively  CL  and  CL  scaled.  The  Gaussian  approximation 

becomes  less  suitable  if  the  inverse  of  the  normalized  variance - — —  or  the  space-scale  s  arc  high.  This 

(B+(C-Ot)2  F  6 

is  a  logical  result.  The  Central  Limit  theorem  deals  with  fluctuations  of  size  0(1/ \/N) ,  but  the  fluctuations 
for  a  low  load  can  be  much  higher  0(1)  and  can  be  calculated  by  the  Large  Deviations  theory.  A  high  value 
of  the  space-scale  s  indicates  less  multiplexing  and  a  value  of  the  effective  bandwidth  closer  to  the  peak  rate. 
Using  the  values  of  a  Gaussian  process  in  the  cumulant  generating  the  normalized  variance  and  the  space 
scale  are  linked16. 

The  scaled  versions  of  the  Central  Limit  approximation  and  the  Moderate  Deviations  estimate  don’t  give 
consistent  results  when  the  load  changes.  These  two  methods  arc  only  acceptable  for  the  calculation  of  the 
buffer  occupation  distribution  in  a  limited  range  of  values  for  c  and  x. 

In  all  the  results,  the  characteristic  time-scale  t  identical  is  almost  identical.  This  allows  to  reduce  the 
expensive  linear  search  for  LD  estimates  by  searching  around  the  values  of  t  found  for  the  MD  or  CL 
approximation. 

A  second  experiment  is  the  multiplexing  of  100  MPEG-1  encoded  Star  wars  Race  files  [23].  The  variation  of 
BOP  and  LR  in  function  of  b  are  represented  in  the  figures  7  and  8  with  a  transmission  capacity  of  respectively 
30  Mbits  and  40  Mbits.  The  load  is  for  the  former  87.5%  and  for  the  latter  65.6%. 

Both  plots  are  very  different  and  confirm  the  results  from  the  multiplexing  of  periodic  on-off  sources.  In 
regions  with  a  high  normalized  variance,  the  MD  scaled  estimate  gives  an  appropriate  upper  bound  while  in 
regions  with  a  low  load  the  LD  BR  estimate  approaches  the  values  of  the  simulation. 

The  plots  show  clearly  that  buffering  is  a  limited  solution  for  a  heavily  loaded  link.  The  real  answer  to  the 
problem  is  increasing  the  transmission  capacity,  the  buffer  can  only  contain  a  limited  burst. 

In  the  last  experiment,  the  Bellcore  trace  file  [17]  is  used  to  demonstrated  the  multiplexing  of  an  unknown 
number  of  traffic  sources  where  the  individual  distributions  are  not  available.  The  BOP  is  calculated  in 
function  of  B.  C  =  2  Mbits  in  the  first  figure  9  and  C  =  4  Mbits  in  the  second  10. 

The  results  are  similar  as  the  previous  and  indicate  the  limits  of  the  applicability  of  a  Gaussian  approximation 
for  aggregated  traffic. 

16For  a  Gaussian  arrival  process,  A,(s)  =  s/jt+  ^.r2Cj/,  st  =  argsupJgR+  ( s(B  +  Ct )  —  A,(j))  =  ^  • 
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7.0  CONCLUSION 

In  this  paper,  the  buffer  flow  probability  for  a  single  FIFO  queue  is  estimated  by  different  formulas  ranging 
from  a  central  limit  approach  to  a  large  deviations  limit.  The  approximations  are  bounded  by  LD  as  an 
upper  bound  and  CL  as  an  lower  bound.  For  regions  with  a  high  normalized  variance  depending  on  the 
characteristic  time-scale  f,  the  difference  between  the  approximations  is  small.  For  low  load  ranges  with 
a  smaller  degree  of  multiplexing,  Sf  large,  only  the  LD  BR  gives  an  accurate  estimate.  This  is  shown  by 
event-driven  simulations  of  both  synthetic  and  real  traffic  sources. 

A  appropriate  designed  network  has  a  low  LR  for  each  queue  in  regime.  If  the  loss  rate  is  too  high,  the 
distribution  of  the  traffic  flow  is  altered  and  a  fluid  based  approach  becomes  very  difficult.  The  LD  estimates 
are  useful  when  the  loss  rate  is  limited,  i.e.  a  high  normalized  variance.  The  computational  expensive  search 
for  the  characteristic  time-scale  t  can  be  reduced  by  performing  a  search  around  the  values  found  for  the  MD 
limit.  In  the  case  of  a  high  load  the  CL  approximation  can  be  used,  knowing  that  the  nice  features  of  LD, 
decoupling  without  smoothing  and  the  effective  bandwidth  interpretation,  are  no  longer  supported. 

The  performance  of  these  formulas  can  also  be  measured  in  a  real  hybrid  fluid-flow  event-driven  simula¬ 
tion  compared  to  a  event-driven  simulation  of  both  background  an  foreground  baffle.  The  loss  rate  for  the 
foreground  traffic  stream  is  however  so  small  that  importance  sampling  is  needed.  The  most  common  impor¬ 
tance  sampling  methods  for  networks  are  based  on  Large  Deviations  results  thus  favoring  the  LD  estimates. 
Nevertheless  the  simulations  can  be  refined  and  speeded  up  to  obtain  tighter  limits  for  the  performance  of 
the  approximations.  This  will  be  investigated  in  a  future  work. 

Topics  for  further  research  also  includes: 

•  programming  a  real  hybrid  fluid-flow  event-driven  simulation  with  the  LD  BR  estimate  as  background 
descriptor; 

•  estimating  the  burst  parameter  p  to  find  the  bottleneck  node  and  adapting,  smoothing,  the  fluid-flows 
after  this  node  in  a  hybrid  simulation  with  a  high  load  scenario; 

•  extending  the  FIFO  queue  to  priority  queueing  and  generalized  processor  sharing; 

•  extending  the  model  to  include  wireless  base-stations  and  ad-hoc  networks. 

A  fluid  flow  buffer  indicator  can  be  found  when  many  baffle  sources  arc  aggregated  but  the  performance 
measures  of  the  resulted  traffic  stream  have  not  always  a  Gaussian  behaviour.  The  degree  of  multiplexing 
plays  an  important  role  and  determines  whether  a  Central  Limit  or  a  Large  Deviations  estimate  is  applicable. 
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ORGANIZATION 


Figure  1 :  Variation  of  BOP  as  a  function  of  N  for  the  multiplexing  of  two  periodic  on/off  sources  where  C=12N  and  B=5N. 


Figure  2:  Variation  of  LR  as  a  function  of  N  for  the  multiplexing  of  two  periodic  on/off  sources  where  C=12N  and  B=5N. 
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Figure  3:  Variation  of  BOP  as  a  function  of  c  for  the  multiplexing  of  two  periodic  on/off  sources  where  N=80  and  B=2N. 


Figure  4:  Variation  of  LR  as  a  function  of  c  for  the  multiplexing  of  two  periodic  on/off  sources  where  N=80  and  B=2N. 
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ORGANIZATION 


Figure  5:  Variation  of  BOP  as  a  function  of  b  for  the  multiplexing  of  two  periodic  on/off  sources  where  N=80  and  C=12N. 


Figure  6:  Variation  of  LR  as  a  function  of  b  for  the  multiplexing  of  two  periodic  on/off  sources  where  N=80  and  C=12N. 
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Figure  7:  Variation  of  BOP  as  a  function  of  b  for  the  multiplexing  of  MPEG  encoded  Star  wars  traces  where  N=100  and  C=30 

Mbits. 


Variation  of  BOP  as  a  function  of  b 


Figure  8:  Variation  of  BOP  as  a  function  of  b  for  the  multiplexing  of  MPEG  encoded  Star  wars  traces  where  N=100  and  C=40 

Mbits. 
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Figure  9:  Variation  of  BOP  as  a  function  of  B  for  the  Bellcore  trace  where  C=2  Mbits. 


Variation  of  BOP  as  a  function  of  B 
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Figure  10:  Variation  of  BOP  as  a  function  of  B  for  the  Bellcore  trace  where  C=4  Mbits. 
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■  B  is  Buffer  Size  in  traffic  units 
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■  Background  Buffer  Occupation  Probability  can  be  estimated 

■  Small  values  of  X:  Central  Limit  Theorem 

■  Larger  Values  of  X:  Large  Deviations  Asymptotic: 

■  At(s)  is  convex  in  s 

■  Bound  for  t  based  on  Central  Limit  Theorem 

■  Moderate  Deviations  Scaling  is  NOT  the  Best  of  both  Models 
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Future  work 


■  Validation  Hybrid  Model:  Importance  Sampling  for  small 
Probabilities 

■  Other  Queueing  Models:  Priority  Queueing,  Generalized 
Processor  Sharing 

■  Smoothing  and  Loss  of  Traffic  for  small  values  of  X  in  function 
of  Burst  parameter 

■  Extension  to  Wireless  Networks 


<  g  ►  <!►  <!►  -1  'O  0,0 


Conclusion 


Questions 


Characterization 
of  Background 
Traffic 
in  Hybrid 
Simulation 

Ben  Lauwens, 
RMA 


Hybrid  Network 
Simulation 
Network  Simulation 
Hybrid  Simulation 

Background 
Traffic  Model 
Lindley’s  Recursion 
Buffer  Occupation 
Distribution 
Large  Deviations 
Asymptotic 
Central  Limit 
Approximation 
Moderate  Deviations 


Contact 


■  E-mail:  Ben.Lauwens@rma.ac.be 

■  Phone:  +32  2  742  6627 


Numerical  Results 

Conclusion 


3 


'0<\0 


